UNCLASSIFIED 


ad  411921 


DEFENSE  DOCUMENTATION  CENTER 

FOR 

SCIENTIFIC  AND  TECHNICAL  INFORMATION 

CAMERON  STATION.  ALEXANDRIA,  VIRGINIA 


UNCLASSIFIED 


NOTICE:  When  government  or  other  drawings,  speci¬ 
fications  or  other  data  are  used  for  any  purpose 
other  than  in  connection  with  a  definitely  related 
government  procurement  operation,  the  U.  S. 
Government  thereby  incurs  no  responsibility,  nor  any 
obligation  whatsoever;  and  the  fact  that  the  Govern¬ 
ment  may  have  formulated,  furnished,  or  in  any  way 
supplied  the  said  drawings,  specifications,  or  other 
data  is  not  to  be  regarded  by  implication  or  other¬ 
wise  as  in  any  manner  licensing  the  holder  or  any 
other  person  or  corporation,  or  conveying  any  rights 
or  permission  to  manufacture,  use  or  sell  any 
patented  invention  that  may  in  any  way  be  related 
thereto. 


CQ 

o> 


CQ 

CD 


CD 


fflF  MAXIMUM  NUMBER  OF  DISJOINT 
TATIONS  CONTAINED  IN  A  MATRIX 
OF  ZEROS  AND  ONES 


s  a 

! 


by 

D.  R.  Fulkerson 


CJ 

05 

tH 


i 


OPERATIONS  RESEARCH  CENTER 


INSTITUTE  OF  ENGINEERING  RESEARCH 


^  J  -  </-  f 

ORC  63-9  (RR) 

1  MAY  1963 

4 


i 

fl 


UNIVERSITY  OF  C  A  L  I  F  0  R  N  I  A  -  B  E  R  K  E  L  E  Y 


THE  MAXIMUM  NUMBER  OF  DISJOINT  PERMUTATIONS  CONTAINED 
IN  A  MATRIX  OF  ZEROS  AND  ONES 


by 


* 

D.R.  Fulkerson 
Operations  Research  Center 
University  of  California,  Berkeley 


1  May  1963 


ORC  63-9  (RR) 


This  research  has  been  partially  supported  by  the  Office  of  Naval  Research 
under  Contract  Nonr-222(83)  with  the  University  of  California.  Reproduc¬ 
tion  in  whole  or  in  part  is  permitted  for  any  purpose  of  the  United  States 
Government. 

* 

The  author  is  a  visiting  research  mathematician  from  The  RAND  Corporation, 
Santa  Monica,  California 


THE  MAXIMUM  NUMBER  OF  DISJOINT  PERMUTATIONS  CONTAINED 
IN  A  MATRIX  OF  ZEROS  AND  ONES 

I.  Introduction 

A  well-known  consequence  of  the  Konig  theorem  on  maximum  matchings  and 
minimum  covers  in  bipartite  graphs  ^  or  of  the  P.  Hall  theorem  on  systems 
of  distinct  representatives  for  sets^  asserts  that  an  n  by  n  (0,1)  - 
matrix  A  having  precisely  p  ones  in  each  row  and  column  can  be  written 
as  a  sum  of  p  permutation  matrices: 

(1.1)  A  =  P  +  p  +  .. .  +  p  . 

12  p 

Our  main  objective  is  a  generalization  of  (1.1)  along  the  following  lines. 

Let  A  be  an  arbitrary  m  by  n  (0,1)  -  matrix.  Call  an  m  by  n 
(0,1)  -  matrix  P  a  permutation  matrix  if  PPT  =  I  ,  where  PT  is  the 
transpose  of  P  and  I  is  the  identity  matrix  of  order  m  .  This  definition 
implies  m  <  n  and  we  shall  assume  throughout  that  this  inequality  holds. 

/Vs  in  (l.l),  we  seek  a  decomposition 

(1.2)  A  =  P1  +  P2  +  ...  +  Pp  +  R 

where  each  ,  i  =  l,...,p  ,  is  a  permutation  matrix,  R  is  a  (0,1)  -  matrix, 
and  the  integer  p  is  maximal. 

If  p  is  maximal  in  (l„2),  the  remainder  R  of  course  contains  no 
permutation  matrix.  The  converse  statement  is  false,  however.  For  example 


has  the  decompositions 


In  Section  4  a  formula  for  the  maximum  value  of  p  in  (l„2)  is  derived. 
Letting  7r  =  7T  (a)  denote  this  maximum  value,  it  is  shown  that 


(1*3)  7r(A)  =  min 

A’ 

Here  A1  is  an  e  by  f  minor  of  A  ,  N^(A')  denotes  the  number  of  l's 
in  A'  ,  s(A')  a  e  +  f  -  n  ,  brackets  denote  biggest  integer,  and  the  minimum 
is  understood  to  be  taken  over  all  minors  A'  of  A  with  s(A')  >  0  .  The 
constructive  proof  singles  out  a  critical  minor  A'  of  A  that  yields  the 
minimum  in  (1.3).  In  the  example  above,  critical  minors  are  the  identity  of 
order  two  in  the  lower  right  corner,  the  last  column,  or  the  last  row. 

Using  the  formula  (1,3)  and  a  result  due  to  Haber, ^  the  integer 

(1.4)  7T  -  min  7r(A) 

kina 

can  also  be  evaluated.  In  (1.4)  a  is  the  class  of  all  m  by  n  (0„l)  - 
matrices  that  have  the  same  row  and  column  sums  as  A  „  The  resulting 
formula  for  tt  is  given  in  Section  5. 

2,  Reformulation 

The  problem  of  determining  7r(A)  for  an  m  by  n  (0  1)-  matrix 
A  =  (a^j)  may  be  reformulated  in  the  following  way;  Determine  an 


W1(A')~ 

s(A') 
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m  by  n  (0,1)  -  matrix  X  *>  (x4 satisfying  the  constraints 


(2.1) 


n 


J"1 


(2.2) 


m 


i=sl 


J  =*  1, 


i 


(2.3) 


f 


and  maximizing  p  . 

To  see  this,  first  note  that  if  (1.2)  holds,  then  X  =  Pn  +  Pc  +  . . .  +  p 

1  d  P 

satisfies  (2.1),  (2.2),  (2.3).  On  the  other  hand,  if  X  is  an  m  by  n 
(0,1)  -  matrix  with  row  sums  equal  to  p  and  column  sums  at  most  p  ,  then 
X  is  a  sum  of  p  permutation  matrices.  This  assertion  can  be  proved  in 
various  ways.  For  instance,  a  theorem  of  Mann  and  Ryser ^  concerning  the 
existence  of  a  system  of  distinct  representatives  that  includes  a  prescribed 
set  of  elements  implies  that  such  a  matrix  X  contains  a  permutation  matrix 
having  a  1  in  each  column  of  X  of  sum  p  .  We  may  thus  write 
X  =  P1  +  ^  ’  where  ^las  row  sums  P  ”  1  and  column  sums  at  most  p  -  1  , 

and  apply  the  theorem  to  .  Repeated  applications  produce  the  desired 
decomposition. 

The  construction  described  in  the  next  section  solves  the  maximum  problem 
(2.1),  (2.2),  (2„3)  by  increasing  the  parameter  p  by  one  at  each  major 
cycle  until  the  maximum  value  tt  is  obtained.  Thus  for  each  value  of  p 
encountered  in  the  construction,  there  will  be  a  decomposition  (1.2)  for  A  . 
The  construction  can  he  started  with  p  =  0  ,  X  =  0  ,  although  this  would, 
in  general,  be  relatively  inefficient  from  the  computational  standpoint. 
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3.  A  Construction 


We  may  suppose,  in  describing  the  construction,  that  A  =  X  +  R  ,  where 

each  row  sum  of  the  (0,l)  -  matrix  X  is  either  p  or  p  +  1  ,  each  column 

sum  of  X  is  at  most  p  +  1  ,  and  R  is  a  (0,1 )  -  matrix.  The  aim  of  a 
major  cycle  of  the  construction  is  to  produce  a  decomposition  A  =  X'  +  R' 
with  X'  having  precisely  p  +  1  1's  in  each  row  and  at  most  p  +  l  i's 

in  each  column.  If  successful,  the  process  is  repeated  with  the  new 
decomposition  and  p  replaced  by  p  +  1  .  if  unsuccessful,  7t(a)  =  p  . 

We  distinguish  the  l's  of  X  in  the  matrix  A  and  call  these 

marked  l's"  of  A  ;  other  l's  of  A  are  "unmarked."  A  row  or  column  of 

A  that  contains  less  than  p  +  1  marked  l's  will  be  termed  "short."  Thus 
short  rows  have  p  marked  l's  ,  but  short  columns  may  have  fewer  than  p 
marked  l's  . 

The  basic  routine  in  the  construction  assigns  "marks"  to  certain  rows 
and  columns  of  A  using  the  iterative  procedure  (5.1),  (3.2)  below. 

(3.1)  Mark  all  short  rows  of  A  . 

(3.2)  Repeat  the  following  two  steps  in  order  until  there  are  either  no 
newly  marked  rows  or  no  newly  marked  columns  of  A  .  (The  short 
rows  of  A  are  called  "newly  marked"  after  application  of  (3.1), 
and  any  other  row  or  column  marked  in  the  immediately  preceding 
application  of  (3.2b)  or  (3.2a)  is  called  "newly  marked ." ) 

^a)  For  each  newly  marked  row,  mark  each  unmarked  column 
containing  an  unmarked  1  in  that  row. 

(b)  For  each  newly  marked  column,  mark  each  unmarked  row 
containing  a  marked  1  in  that  column. 

At  the  conclusion  cf  the  row  and  column  marking  process  we  distinguish 
two  cases. 
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CASE  1:  A  short  column  of  A  has  been  marked.  In  this  case  the  marking 
procedure  has  located  a  sequence  of  r  ^1  unmarked  and  r  -  1  marked  l's 
having  the  form 


(3.3) 


Vl 


i2Jl 


~2J2 


irJr  1 

r  r-i 


ij 
r  r 


with  a.  unmarked,  a,  .  marked,  ...  a,  .  unmarked,  and  such  that 
1J1  X2J1  irJr 

row  i^  and  column  are  short.  We  then  interchange  marked  and  unmarked 

l's  in  (3.3).  This  yields  a  new  decomposition  A  =  X*  +  R*  ,  with  X* 

containing  one  more  1  than  X  and  having  row  sums  p  or  p  +  1  ,  column 

sums  at  most  p  +  1  .  With  reference  to  the  new  marking  of  l's  given  by 
* 

X  ,  A  has  one  less  short  row.  The  procedure  (3.1),  (3.2)  is  then  repeated 
with  the  new  set  of  marked  l's  of  A  . 

CASE  2:  No  short  columr  of  A  has  been  marked.  If  A  has  no  marked  rows 
(and  hence  no  short  rows),  replace  p  by  p  +  1  and  repeat  (3.1),  (3.2)  with 
the  new  meaning  for  short  row  and  column.  If  A  has  marked  rows  (and  hence 
short  rows),  the  construction  ends. 

It  is  clear  that  the  process  terminates.  We  shall  show  in  the  next 
section  that  the  terminal  value  of  p  is  7t(a)  . 

4.  A  Formula  for  7t(a) 

To  establish  the  formula  (1.3)  for  7t(a)  ,  we  show  first  that  if  (l.2) 
holds,  and  if  A'  is  a  minor  of  A  with  s(A')  >  0  ,  then 


(4.1) 


P  1 


'^(A*)- 
.  s(A')_ 
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Thus,  assume  (1.2)  and  let  A'  be  an  e  by  f  minor  of  A  with 
s(A')  ■  e  +  f  -  n>0  .  Rearranging  rows  and  columns  of  A  ,  we  may  write 


(4.2) 


A'  * 


*  A" 


X'  * 


*  X" 


R'  * 


*  R" 


It  follows  that 


(4.3) 


p(e  +  f  -  n)  <  N^X’)  -  N^X")  ^N-jA')  , 


verifying  (4.1). 

To  complete  the  proof,  it  suffices  to  show  that  there  is  a  p  corresponding 
to  a  decomposition  (1.2),  and  an  e  by  f  minor  A'  of  A  with  s(A')  >  0  , 
for  which  equality  holds  in  (4.1).  Let  p  be  the  terminal  value  in  the 
construction  of  the  preceding  section,  let  A'  be  the  e  by  f  minor 
corresponding  to  the  terminal  sets  of  marked  rows  and  unmarked  columns,  and 
let  A"  be  the  m  -  e  by  n  -  f  complementary  minor.  By  (3.2a),  A' 
contains  no  unmarked  l's  ;  by  (3.2b),  A"  contains  no  marked  l's  .  Mere- 
over,  by  (3.1),  each  unmarked  row  of  A  contains  p  +  1  marked  l's  ; 
and,  by  the  Case  2  hypothesis,  each  marked  column  of  A  contains  p  +  1 
marked  l's  .  Let  t  be  the  total  number  of  marked  l's  in  A  .  The 
above  remarks  provide  a  count  for  t  : 


t  =  (p  +  l)(m  -  e)  +  (p  +  l)(n  -  f)  +  (A' ) 


Since  the  construction  has  ended,  there  is  at  least  one  row  of  A  having 
p  marked  l's.  Consequently 


(4.5) 


t  <  (p  l)m 
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It  follows  from  (4.4)  and  (4.5)  that 


(4.6)  (p  +  l)(e  +  f  -  n)  >  (A1 ) 

In  particular,  s(A')«e+f-n>0. 

Since  A  has  a  decomposition  (1.2)  for  the  terminal  value  p  ,  we  have 
from  (4.3)  and  (4.6), 

(^•7)  p(e  +  f  -  n)  £  N^A')  <  (p  +  l)(e  +  f  -  n)  . 

This  shows  that  the  terminal  p  is  n(A)  and  establishes  the  formula  (1.3) 
for  tt(a)  . 

In  case  A  is  n  by  n  with  precisely  p  l’s  in  each  row  and  column, 
the  formula  (1.3)  reduces  to  7r(A)  =  p  .  This  may  be  checked  as  follows. 

Let  A'  =  A  in  the  right  side  of  (1.3)  to  get 


NX(A) 

s  (a) 


On  the  other  hand,  if  such  an  A  contained  an  e  by  f  minor  A'  with 
s(A' )  >  0  and 


'H^(A')' 
.  s(A'). 


<  P 


then 


^(A1 )  <  p(e  +  f  -  n) 


But  writing 


we  see  that 


p(e  +  f  -  n)  =  N^A')  -  N^A")  , 

a  contradiction.  Hence  for  such  an  A  ,  the  right  side  of  (1.5)  is  p  . 

5.  A  Formula  for  tc 

Let  -t  denote  the  class  of  all  m  by  n  (0,l)  -  matrices  having  row 

sums 

(5.1)  ri^r2>  ...  >rm>  0 

and  column  sums 


(5.2) 


sl^s2^’*  >  sn^0  . 


The  class  can  also  he  viewed  as  generated  from  an  arbitrary  A  in  at 

[7] 

by  interchanges.  Here  an  interchange  is  a  transformation  on  the  elements 
of  A  that  changes  a  minor  of  type  (a)  te-low  into  one  of  type  (b),  or 
vice  versa,  and  leaves  all  other  elements  fixed.: 


In  this  section  we  describe  a  formula  for 
(5.3)  7T  a  min  tt(a)  , 

A  in# 

which  is  similar  to  those  for  maximal  term  rank,  minimal  term  rank, 

[9]  r  2] 

maximal  and  minimal  trace,  and  minimal  width.  Note  that  the  normali¬ 
zation  assumptions  (5.1)  and  (5.2)  are  no  restriction  in  determining  m  . 
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The  key  additional  result  need' d  in  evaluating  -  has  been  furnished 
by  Haber  in  his  study  of  minimal  term  rank .  ^  *  To  describe  this  result, 
we  begin  with  the  function 

(5.4)  t(e,f)  -  ef  +  (rg+1  +  ...  +  rm)  -  ^  +  ...  +  sf)  , 

(0£e£m,0£f<n)  . 

This  function,  introduced  in  [9],  can  also  be  viewed  in  the  following  manner. 
Let  A  in  &  be  written 


with  A^  of  size  e  by  f  .  Then  t(e,f)  is  the  number  of  O' s  in  A^ 
plus  the  number  of  l's  in  kp : 


(5.6) 


t(e,f)  =  Nq(Al)  +  N1(A2) 


Next  define,  as  in  [3], 


(5.7)  V'(e>f)  -  ^  t  , 

e  1  1  1  m  J 

f<:uc 

(5.8)  0(e,f)  =  mir.  (t..  +  t,.  +  (e-i)(f-j)) 

0  <  i  <  e  1  J 

e  £  k  <'  m 
f  1  i  <  n 

Now  let  H(e,f)  denote  the  maximum  number  of  0’s  that  any  matrix  in  the 
normalized  class  &  can  contain  in  its  leading  e  by  f  minor.  An 
ingenious  argument  in  [3]  shows  that 
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(5.9) 


H(e,f)  .  min  fiK*,f)  ,  0{e,f)) 


From  (5.6),  if  A  in  d  contains  the  maximum  number  of  0's  possible 
in  the  leading  e'  by  f'  minor,  then  A  contains  the  minimum  number 

(5.10)  t(e',f *)  -  H(e',f') 

of  l's  possible  in  the  complementary  e  =  m  -  e'  by  f  =  n  -  f'  minor. 
Thus,  provided  it  can  be  shown  that  the  minimum  number  of  l's  possible  in 
any  e  by  f  minor  of  matrices  in  d  is  achieved  in  the  lower  right  e  by  f 
minor  of  some  A  in  d  ,  it  would  follow  from  (1.5)  and  (5.10)  that 

t(e',f')  -  H(e 1 ,  f '  )~| 
m  -  e'  -  f' 

the  minimum  being  taken  over  e',f:  satisfying 

(5.12)  e'  ■*.  f'  <  m 

The  proviso  above  can  be  established  by  an  interchange  argument.  Let 
e  and  f  be  fixed  and  consider  an  e  by  f  it*  nor  A’  of  A  in  the 
normalized  class  d  .  Let  i  be  a  row  of  A  that  is  not  a  row  of  A'  , 

and  let  i 1  be  a  row  of  A  that  is  a  row  cf  A  ,  with  i '  <  i  .  By 

(5.1),  interchanges  confined  to  rows  i'  and  1  of  A  can  be  applied  in 

such  a  way  that  the  transformed  matrix  B  has  an  e  by  f  miner  B'  with 

the  following  properties:  Columns  of  E'  have  the  3ame  index  set  as  those 
of  A'  ;  rows  of  B'  have  the  same  index  set  as  these  of  A'  ,  except  that 
i  has  replaced  i'  ;  N^(E' )  <  N^A')  .  Repetition  of  this  argument,  first 

on  rows,  then  on  columns,  shows  that  the  e  by  f  minor  with  the  minimum 
number  of  l's  possible  in  the  normalized  class  d  tan  be  assumed  to  be  in 


(5.11) 


7T 


min 
e '  f ' 
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the  lower  right  corner.  Hence  (5.11)  is  valid. 


6. _ Remarks  and  Questions 

The  recursive  procedure  (3.1),  (3.2),  which  was  the  main  tool  in 
establishing  the  formula  (1.3)  for  r( A)  ,  is  a  variation,  suitable  to 
the  problem  at  hand,  of  the  labeling  method  for  constructing  maximal  flows 
in  capacity-constrained  networks. ^  Formula  (1.3)  can  also  be  deduced  from 
known  theorems  concerning  network  flows. 

There  is  an  analogue  for  matrices  over  non-negative  reals.  Specifically, 
suppose  given  an  m  by  n  A  =  (aij)>  aij  >  0  *  and  ask  for  a  non-negative 
matrix  X  =  (x^)  satisfying  (2.1),  (2.2),  (2.3)  with  p  maximal.  If  we 
let  (A1 )  denote  the  sum  of  all  entries  of  A'  ,  s(A')  the  sum  of  the 
dimensions  of  A'  minus  n  ,  then  the  maximum  value  of  p  is  given  by 
dropping  brackets  in  (1.3). 

Returning  to  (0,1)  -  matrices,  other  related  questions  are  suggested. 

For  example,  what  is  the  minimum  value  of  p  in  a  decomposition  (1.2),  for 
a  fixed  matrix  A  ,  assuming  that  the  remainder  R  contains  no  permutation 
matrix?  Or  what  is  the  maximum  value  of  7r(A)  over  the  class  generated  by 
A  ?  Both  of  these  questions  seem  harder  to  answer  than  these  considered 
here. 
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